Estimation, Optimization, and Parallelism when Data is Sparse or Highly Varying
نویسندگان
چکیده
We study stochastic optimization problems when the data is sparse, which is in a sensedual to the current understanding of high-dimensional statistical learning and optimization.We highlight both the difficulties—in terms of increased sample complexity that sparse datanecessitates—and the potential benefits, in terms of allowing parallelism and asynchrony in thedesign of algorithms. Concretely, we derive matching upper and lower bounds on the minimaxrate for optimization and learning with sparse data, and we exhibit algorithms achieving theserates. We also show how leveraging sparsity leads to (still minimax optimal) parallel andasynchronous algorithms, providing experimental evidence complementing our theoretical resultson several medium to large-scale learning tasks.
منابع مشابه
Estimation, Optimization, and Parallelism when Data is Sparse
We study stochastic optimization problems when the data is sparse, which is in a sense dual to current perspectives on high-dimensional statistical learning and optimization. We highlight both the difficulties—in terms of increased sample complexity that sparse data necessitates—and the potential benefits, in terms of allowing parallelism and asynchrony in the design of algorithms. Concretely, ...
متن کاملRobust Estimation in Linear Regression with Molticollinearity and Sparse Models
One of the factors affecting the statistical analysis of the data is the presence of outliers. The methods which are not affected by the outliers are called robust methods. Robust regression methods are robust estimation methods of regression model parameters in the presence of outliers. Besides outliers, the linear dependency of regressor variables, which is called multicollinearity...
متن کاملLong-term Iran's inflation analysis using varying coefficient model
Varying coefficient Models are among the most important tools for discovering the dynamic patterns when a fixed pattern does not fit adequately well on the data, due to existing diverse temporal or local patterns. These models are natural extensions of classical parametric models that have achieved great popularity in data analysis with good interpretability.The high flexibility and interpretab...
متن کاملImproved Channel Estimation for DVB-T2 Systems by Utilizing Side Information on OFDM Sparse Channel Estimation
The second generation of digital video broadcasting (DVB-T2) standard utilizes orthogonal frequency division multiplexing (OFDM) system to reduce and to compensate the channel effects by utilizing its estimation. Since wireless channels are inherently sparse, it is possible to utilize sparse representation (SR) methods to estimate the channel. In addition to sparsity feature of the channel, the...
متن کاملStatistical Inference for Varying Coefficient Models
This dissertation contains two projects that are related to varying coefficient models. The traditional least squares based kernel estimates of the varying coefficient model will lose some efficiency when the error distribution is not normal. In the first project, we propose a novel adaptive estimation method that can adapt to different error distributions and provide an efficient EM algorithm ...
متن کامل